منابع مشابه
16-899C ACRL Tetris Reinforcement Learner
Our approach to this problem was to use reinforcement learning with a function approximator to approximate the state value function [RSS98]. In our case, a +1 reward was given for every completed line, so that the value function would encode the long-term number of lines that is going to be completed by the algorithm. In order to achieve this, we extract features from the game state, and use gr...
متن کاملOptimal Algorithms, International Symposium, Varna, Bulgaria, May 29 - June 2, 1989, Proceedings
Imagine that you get such certain awesome experience and knowledge by only reading a book. How can? It seems to be greater when a book can be the best thing to discover. Books now will appear in printed and soft file collection. One of them is this book optimal algorithms international symposium varna bulgaria may 29 june 2 1989 proceedings. It is so usual with the printed books. However, many ...
متن کاملSymposium on risk factors and mechanisms in carcinogenesis. June 26-28, 1989, Wurzburg, West Germany. Proceedings.
The pharmacokinetics of the hypoxic radio-sensitizer nimorazole were studied in 19 individuals after single oral doses of between 0.5-3.5 g. HPLC measurements showed, after a rapid absorption, a linear relationship between peak plasma concentration and given dose. Mean elimination half life was 3.1 h. A tendency to a dose-dependent variation in the apparent volume of distribution, total body cl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: College & Research Libraries News
سال: 1989
ISSN: 2150-6698,0099-0086
DOI: 10.5860/crln.50.8.696